NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery

Mall, Utkarsh; Phoo, Cheng Perng; Chiquier, Mia; Hariharan, Bharath; Bala, Kavita; Vondrick, Carl (June 2025, CVPR)

Full Text Available
Self-Improving Autonomous Underwater Manipulation

https://doi.org/10.1109/ICRA55743.2025.11128759

Liu, Ruoshi; Ha, Huy; Hou, Mengxue; Song, Shuran; Vondrick, Carl (May 2025, IEEE)

Full Text Available
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities

Menon, Sachit; Zemel, Richard; Vondrick, Carl (November 2024, EMNLP)

Full Text Available
Evolving Interpretable Visual Classifiers with Large Language Models

https://doi.org/10.1007/978-3-031-73039-9_11

Chiquier, Mia; Mall, Utkarsh; Vondrick, Carl (October 2024, Springer Nature Switzerland)

Full Text Available
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation

Liang, Junbang; Liu, Ruoshi; Ozguroglu, Ege; Sudhakar, Sruthi; Dave, Achal; Tokmakov, Pavel; Song, Shuran; Vondrick, Carl (November 2024, CoRL)

Full Text Available
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis

Van_Hoorick, Basile; Wu, Rundi; Ozguroglu, Ege; Sargent, Kyle; Liu, Ruoshi; Tokmakov, Pavel; Dave, Achal; Zheng, Changxi; Vondrick, Carl (September 2024, European Conference on Computer Vision)

Full Text Available
Seeing Science: Inquiry-Based Learning at Home Through Mobile Messaging System

https://doi.org/10.1145/3628516.3659396

Fuhrmann, Tamar; Lemee, Marina A; Pang, Jonathan; You, Je Seung; Chilton, Lydia B; Vondrick, Carl; Blikstein, Paulo (June 2024, ACM)

Full Text Available
Fully body visual self-modeling of robot morphologies

https://doi.org/10.1126/scirobotics.abn1944

Chen, Boyuan; Kwiatkowski, Robert; Vondrick, Carl; Lipson, Hod (July 2022, Science Robotics)

A robot can learn full-body morphology via visual self-modeling to adapt to multiple motion planning and control tasks.
more » « less
Full Text Available
Listening to Sounds of Silence for Speech Denoising

Xu, Ruilin; Wu, Rundi; Ishiwaka, Yuko; Vondrick, Carl; Zheng, Changxi (December 2020, Advances in neural information processing systems)

We introduce a deep learning model for speech denoising, a long-standing challenge in audio analysis arising in numerous applications. Our approach is based on a key observation about human speech: there is often a short pause between each sentence or word. In a recorded speech signal, those pauses introduce a series of time periods during which only noise is present. We leverage these incidental silent intervals to learn a model for automatic speech denoising given only mono-channel audio. Detected silent intervals over time expose not just pure noise but its time-varying features, allowing the model to learn noise dynamics and suppress it from the speech signal. Experiments on multiple datasets confirm the pivotal role of silent interval detection for speech denoising, and our method outperforms several state-of-the-art denoising methods, including those that accept only audio input (like ours) and those that denoise based on audiovisual input (and hence require more information). We also show that our method enjoys excellent generalization properties, such as denoising spoken languages not seen during training.
more » « less
Full Text Available
Visual Hide and Seek

https://doi.org/10.1162/isal_a_00269

Chen, Boyuan; Song, Shuran; Lipson, Hod; Vondrick, Carl (January 2020, The 2020 Conference on Artificial Life)

We train embodied agents to play Visual Hide and Seek to study the relationship between agent behaviors and environmental complexity. In Visual Hide and Seek, a prey must navigate in a simulated environment in order to avoid capture from a predator, only relying on first-person visual observations. By probing different environmental factors, agents exhibit diverse hiding strategies and even the knowledge of its own visibility to other agents in the scene. Furthermore, we quantitatively analyze how agent weaknesses, such as slower speed, affect the learned policy. Our results suggest that, although agent weakness makes the learning problem more challenging, they also cause more useful features to be learned.
more » « less
Full Text Available

Search for: All records